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CG14440, Gpdh, Synaptojanin-like homologous proteins involved in the 
regulation of energy homeostasis 



This invention relates to the use of nucleic acid sequences encoding 
CG 14440, Gpdh, or Synaptojanin-like homologous proteins, and the 
polypeptides encoded thereby and to the use thereof in the diagnosis, 

io study, prevention, and treatment of diseases and disorders. .related to 
body- weight regulation, for example, but not limited to, metabolic diseases 
such as obesity as well as related disorders such as eating disorder, 
cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. 

15 cancers of the reproductive organs, and sleep apnea. 

Obesity is one of the most prevalent metabolic disorders in the world. It is 
still a poorly understood human disease that becomes more and more 
relevant for western society. Obesity is defined as an excess of body fat, 

20 frequently resulting in a significant impairment of health. Besides severe 
risks of illness such as diabetes, hypertension and heart disease, 
individuals suffering from obesity are often isolated socially. Human obesity 
is strongly influenced by environmental and genetic factors, whereby the 
environmental influence is often a hurdle for the identification of (human) 

25 obesity genes. Obesity is influenced by genetic, metabolic, biochemical, 
psychological, and behavioral factors. As such, it is a complex disorder 
that must be addressed on several fronts to achieve lasting positive clinical 
outcome. Obese individuals are prone to ailments including: diabetes 
mellitus, hypertension, coronary heart disease, hypercholesterolemia, 

30 dyslipidemia, osteoarthritis, gallstones, cancers of the reproductive organs, 
and sleep apnea. 
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Obesity is not to be considered as a single disorder but a heterogeneous 
group of conditions with (potential) multiple causes. Obesity is also 
characterized by elevated fasting plasma insulin and an exaggerated insulin 
response to oral glucose intake (Koltermann, J. Clin. Invest 65, 1980, 
1 272-1 284) and a clear involvement of obesity in type 2 diabetes mellitus 
can be confirmed (Kopelman, Nature 404, 2000, 635-643). 

Even if several candidate genes have been described which are supposed 
to influence the homeostatic system(s) that regulate body mass/weight, 
like leptin, VCPI, VCPL or the peroxisome proliferatoc-activated 
receptor-gamma co-activator, the distinct molecular mechanisms and/or 
molecules influencing obesity or body weight/body mass regulations are 
not known. 

Therefore, the technical problem underlying the present invention was to 
provide for means and methods for modulating (pathological) metabolic 
conditions influencing body-weight regulation and/or energy homeostatic 
circuits. The solution to said technical problem is achieved by providing the 
embodiments characterized in the claims. 

Accordingly, the present invention relates to genes with novel functions in 
body-weight regulation, energy homeostasis, metabolism, and obesity. The 
present invention discloses specific genes involved in the regulation of 
body-weight, energy homeostasis, metabolism, and obesity, and thus in 
disorders related thereto such as eating disorder, cachexia, diabetes 
mellitus, hypertension, coronary heart disease, hypercholesterolemia, 
dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the 
reproductive organs, and sleep apnea. In particular, the present invention 
describes the human CG14440, GIycerol-3-phosphate dehydrogenase 
(referred to as Gpdh or G3PDH), or Synapfojanin-like homologous genes as 
being involved in those conditions mentioned above. 




The term 'GenBank Accession number' relates to NCBI GenBank database 
entries (Benson et al, Nucleic Acids Res. 28, 2000, 15-18). 

The Drosophila gene with GadFly Accession Number CG1 44-40 encodes for 
5 a protein which is most homologous to the human hypothetical protein 
LOC55565 (GenBank Accession Number NP_060000.1 for the protein, 
NMJ317530 for the cDNA). No functional data are available for these 
proteins in the prior art. 

10 - 

Glycerol-3-phosphate dehydrogenase (G3PDH) catalyzes the following 
reaction: Dihydroxyacetone phosphate, NADH, and H+ react to 
L-glycerol-3-phosphate and NAD + . G3PDH activity has been demonstrated 
in a wide variety of species belonging to animal as well as plant kingdoms 

15 (Lin EX., 1977, Annu Rev Biochem. 46:765-795). Drosophila Gpdh 
encodes a glyceroI-3-phosphate dehydrogenase (NAD •+- ) involved in 
glycerophosphate shuttle in the cytoplasm. N AD-dependent 
glycerol-3-phosphate dehydrogenase (EC 1 .1 .1 .8) is a cytoplasmic protein, 
active as a homodimer (Von Kalm L. et al., 1989, Proc. Natl. Acad. Sci. 

20 U.S.A. 86:5020-5024) each monomer containing an N-terminal NAD 
binding site (Otto J. et al., 1980, Eur. J. Biochem. 109: 325-330). It is 
necessary for the transfer of the H + of the NADH - generated in the 
glycolysis - from the cytoplasm into the mitochondria, where it is used in 
the respiratory chain. In insects, it acts in conjunction with a mitochondrial 

25 alpha-glycerophosphate oxidase in the alpha-glycerophosphate cycle, 
which is essential for the production of energy used in insect flight. G3PDH 
catalyzes the reversible reduction of dihydroxyacetone phosphate (with 2 
H + ) to gIycerol-3-phosphate, which can diffuse in the mitochondria, where 
reoxidation regenerates the products and leads to FADH2 (2 ATP). 

30 

Comparisons of physicochemical characteristics, tissue distribution and 
coenzyme requirement point to a separate genetic determination and low 



level of evolutionary relatedness between mitochondrial 
g lycerol-3-phosphate dehydrogenase (GPDM) and its cytosolic counterpart 
(GPDS) (Shaw M.A. et al., 1982, Ann Hum Genet 46(Pt 1):1 1-23). 

A. marked reduction in G3PDH activity was observed in chronic plaques of 
multiple sclerosis (Hirsch H.E. et al, 1980, J. Neurochem. 34(3):760-762). 
Insulin exposure . stimulates an increase in G3PDH activity in human 
lymphocytes that correlates to an increase in G3PDH mRNA. The insulin 
stimulation of G3PDH activity is decreased in obese individuals. Treatment 
of those patients by a very low calorie? diet restores insulin stimulation of 
C3 3PDH activity. Thus, insulin stimulation of G3PDH activity is dependent 
upon the metabolic state of the subject from whom the cells are obtained 
(Tu et af., 1995, Biochem Biophys Res Commun 207(1 ):1 83-1 90). 
However, although a stimulation of G3PDH by insulin was described in the 
prior art, it has not been described in the prior art that G3Pt)H is directly 
involved in the regulation of energy homoestasis and thus involved in the 
storage of triglycerides. 

Synaptic vesicles are recycled with remarkable speed and precision in 
nerve terminals. A major recycling pathway involves ctathrin-mediated 
endocytosis at endocytic zones located around sites of release. Different 
'accessory' proteins linked to this pathway have been shown to alter the 
shape and composition of lipid membranes, to modify membrane-coat 
protein interactions, and to influence actin polymerization. These include 
th e GTPase dynamin, the lysophosphatidic acid acyl transferase endophilin, 
and the phosphoinositide phosphatase synaptojanin (Brodin L. et al., 2O00, 
Curr Opin Neurobiol 1 0(3) :31 2-320). Studies on the endocytosis of 
synaptic vesicles have shown the essential roles of endophilin and 
synaptojanin in vesicle formation {see, Ringstad N. et al., 1999, Neuron 
24(1): 143-1 54). The recessive suppressor of secretory defect in yeast 
Golgi and yeast actin function belongs to this family (Luo W. and Chang 
A., 1997, J Cell Biol 1 38(4) :73 1-746). This protein may be involved in the 




coordination of the activities of the secretory pathway and the actin 
cytoskeleton. Human synaptojanin, which may be localised on coated 
endocytic intermediates in nerve terminals also belongs to this family 
(Haffner C. et ah, 1997, FEBS Lett 41 9(2-3):1 75-1 80). Studies on the 
5 endocytosis of synaptic vesicles have shown the essential roles of 
endophilin and synaptojanin in vesicle formation (see, Ringstad N. et al., 
1999 r Neuron 24(1): 143-1 54). 

So far, it has not been described that CG1 4440, Gpdh, or Synaptojanin-like 
10 and homologous proteins are involved in the regulation of . energy 
homeostasis and body-weight regulation and related disorders, and thus, 
no functions in metabolic diseases and other diseases as listed above have 
been discussed. In this invention we demonstrate that the correct gene 
dose of CG 14440, Gpdh, or Synaptojanin-like is essential for maintenance 
is of energy homeostasis. A genetic screen was used to identify that 
mutation of CG 1 4440, Gpdh, or Synaptojanin-like homologous genes cause 
obesity, reflected by a significant change of triglyceride content, the major 
energy storage substance. 

20 Before the present proteins, nucleotide sequences, and methods are 
described, it is understood that this invention is not limited to the particular 
methodology, protocols, cell lines, vectors, and reagents described as 
these may vary. It is also to be understood that the terminology used 
herein is for the purpose of describing particular embodiments only, and is 

25 not intended to limit the scope of the present invention that will be limited 
only by the appended claims. Unless defined otherwise, all technical and 
scientific terms used herein have the same meanings as commonly 
understood by one of ordinary skill in the art to which this invention 
belongs. Although any methods and materials similar or equivalent to those 

30 described herein can be used in the practice or testing of the present 
invention, the preferred methods, devices, and materials are now 
described. All publications mentioned herein are incorporated herein by 
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reference for the purpose of describing and disclosing the cell lines, 
vectors, and methodologies that are reported in the publications which 
might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate 
5 such disclosure. 

The present invention discloses that CG 14440, Gpdh, or Synaptojanin-like 
homologous proteins are regulating the energy homeostasis and fat 
metabolism especially the metabolism and storage of triglycerides, and 

o polynucleotides, which identify and encode the proteins disclosed in this 
invention. The invention also relates to vectors, host cells, antibodies, and 
recombinant methods for producing the polypeptides and polynucleotides 
of the invention. The invention also relates to the use of these sequences 
in the diagnosis, study, prevention, and treatment of diseases and 

5 disorders, for example, but not limited to, metabolic diseases such as 
obesity as well as related disorders such as eating disorder, cachexia, 
diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. 
cancers of the reproductive organs, and sleep apnea. 

> 

CG 14440, Gpdh, or Synaptojanin-like homologous proteins and nucleic 
acid molecules coding therefore are obtainable from insect or vertebrate 
species, e.g. mammals or birds. Particularly preferred are homologous 
nucleic acids, particularly nucleic acids encoding the human hypothetical 
protein LOC55565, human soluble glycerol-3-phosphate dehydrogenase 1 , 
or human KIAA0966 protein. 

The invention particularly relates to a nucleic acid molecule encoding a 
polypeptide contributing to regulating the energy homeostasis and the 
metabolism of triglycerides, wherein said nucleic acid molecule comprises 
(a) the nucleotide sequence of (i) CG 14440 {GadFly Accession 
Number), or encoding the human hypothetical protein LOC55565, 



(ii) Gpdh (Glycerol 3 phosphate dehydrogenase, GadFly Accession 
Number CG9042), or encoding the human soluble 
glycerol-3-phosphate dehydrogenase 1, or (iii) CG7956 (GadFly 
Accession Number), or encoding human KIAA0966 protein, and/or 
a sequence complementary thereto, 

(b) a nucleotide sequence which hybridizes at 50°C in a solution 
containing 1 x SSC and 0.1 % SDS to a sequence of (a), 

(c) a sequence corresponding to the sequences of (a) or (b) within the 
degeneration of the genetic code, 

(d) a sequence which encodes a polypeptide which is at least 85%, 
preferably at least 90%, more preferably at least 95%, more 
preferably at least 98% and up to 99,6% identical to the amino acid 
sequences of CG 14440, Gpdh, or Synaptojanin-Iike protein, 
preferably of the human hypothetical protein L0C55 565, human 
soluble glycerol-3-phosphate dehydrogenase 1, or human KIAA0966 
protein, 

(e) a sequence which differs from the nucleic acid molecule of (a) to (d) 
by mutation and wherein said mutation causes an alteration, 
deletion, duplication and/or premature stop in the encoded 
polypeptide or 

(f) a partial sequence of any of the nucleotide sequences of (a) to (e) 
having a length of at least 15 bases, preferably at least 20 bases, 
more preferably at least 25 bases and most preferably at least 50 
bases. 

The present invention relates to genes with novel functions in body-weight 
regulation, energy homeostasis, metabolism, and obesity. The invention is 
based on the finding that CG 14440, Gpdh, or Synsptojanin-like 
homologous proteins (herein referred to as CG14440, Gpdh, or 
Synaptojanin-Iike) and the polynucleotides encoding these, are involved in 
the regulation of triglyceride storage and therefore energy homeostasis. To 1 
find genes with novel functions in energy homeostasis, metabolism, and 



obesity, a functional genetic screen was performed with the model 
organism Drosophila melanogaster (Meigen). One resource for screening 
was a Drosophila melanogaster stock collection of EP-Iines. The P-vector of 
this collection has Gal4-UAS~binding sites fused to a basal promoter that 
can transcribe adjacent genomic Drosophila sequences upon binding of 
Gal4 to UAS-sites. This enables the EP-line collection for overexpression of 
endogenous flanking gene sequences. In addition, without activation of the 
UAS-sites, integration of the EP-element into the gene is likely to cause a 
reduction of gene activity, and allows determining its function by 
evaluating the loss-of-f unction phenotype. 

Triglycerides are the most efficient storage for energy in cells, and obese 
patients mainly show a significant increase in the content of triglycerides. 
In order to isolate genes with a function in energy homeostasis, several 
thousand EP-Iines were tested for their triglyceride content after a 
prolonged feeding period (see Examples for more detail). Lines with 
significantly changed triglyceride content were selected as positive 
candidates for further analysis. The increase or decrease of triglyceride 
content due to the loss of a gene function suggests gene activities in 
energy homeostasis in a dose dependent manner that controls the amount 
of energy stored as triglycerides. 

In this invention, the content of triglycerides of a pool of flies with the 
same genotype after feeding for six days was analyzed using a triglyceride 
assay. Male flies homozygous for the integration of vectors for Drosophila 
lines PX10162.1, HD-EP(2)21 956, or HD-EP{3)31 805 were analyzed in an 
assay measuring the triglyceride contents of these flies, illustrated in more 
detail in the EXAMPLES section. The results of the triglyceride content 
analysis are shown in FIGURES 1 , 4 and 7, respectively. 

Genomic DNA sequences were isolated that are localized to the EP vector- 
(herein PX10162.1, HD-EP(2)21 956, or HD-EP(3)31 805) integration. Using 



those isolated genomic sequences public databases like Berkeley 
Drosophila Genome Project (GadFly) were screened thereby identifying the 
integration site of the vectors, and the corresponding genes, described in 
more detail in the EXAMPLES section. The molecular organization of the 
5 genes is shown in FIGURES 2, 5 and 8, respectively. 

The present invention further describes polypeptides comprising the amino 
acid sequences of CG 14440, Gpdh, or Synaptojanin-like and homologous 
proteins. Based upon homology, the proteins of the invention and each 
10 homologous protein or peptide may share at least some activity. No 
functional data described the regulation of body weight control and related 
metabolic diseases are available in the prior art for the genes of the 
invention. 

15 The invention also encompasses polynucleotides that encode CG 14440, 
Gpdh, or Synaptojanin-like and homologous proteins. Accordingly, any 
nucleic acid sequence, which encodes the amino acid sequences of 
CGI 4440, Gpdh, or Synaptojanin-like and homologous proteins, can be 
used to generate recombinant molecules that express CG1 4440, Gpdh, or 

20 Synaptojanin-like and homologous proteins. In a particular embodiment, the 
invention encompasses a nucleic acid encoding CG 14440 (GadFly 
Accession Number), the human hypothetical protein LOC55565, Gpdh 
(Glycerol 3 phosphate dehydrogenase, GadFly Accession Number 
CG9042), the human soluble glycerol-3-phosphate dehydrogenase 1, 

25 CG7956 (GadFly Accession Number), or the human KIAA0966 protein. It 
will be appreciated by those skilled in the art that as a result of the 
degeneracy of the genetic code, a multitude of nucleotide sequences 
encoding the proteins, some bearing minimal homology to the nucleotide 
sequences of any known and naturally occurring gene, may be produced. 

30 Thus, the invention contemplates each and every possible variation of 
nucleotide sequence that could be made by selecting combinations based 
on possible codon choices. These combinations are made in accordance 



with the standard triplet genetic code as applied to the nucleotide 
sequences of naturally occurring CG 14440, Gpdh, or Synaptojanin-like and 
homologous proteins, and all such variations are to be considered as being 
specifically disclosed. Although nucleotide sequences, which encode the 
proteins, and their variants are preferably capable of hybridizing to the 
nucleotide sequences of the naturally occurring proteins under 
appropriately selected conditions of stringency, it may be advantageous to 
produce nucleotide sequences encoding the proteins or their derivatives 
possessing a substantially different codon usage. Codons may be selected 
to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency 
with which particular codons are utilized by the host- Other reasons for 
substantially altering the nucleotide sequence encoding the proteins and 
their derivatives without altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, 
such as a greater half-life, than transcripts produced from the naturally 
occurring sequences. The invention also encompasses production of DNA 
sequences or portions thereof, which encode the proteins and their 
derivatives, entirely by synthetic chemistry. After production, the synthetic 
sequence may be inserted into any of the many available expression 
vectors and cell systems using reagents that are well known in the art at 
the time of the filing of this application. Moreover, synthetic chemistry may 
be used to introduce mutations into a sequence encoding the protein or 
any portion thereof. 

Also encompassed by the invention are polynucleotide sequences that are 
capable of hybridizing to the claimed nucleotide sequences, and in 
particular, those of the polynucleotide encoding CG14440 (GadFly 
Accession Number), the human hypothetical protein LOC55565, Gpdh 
(Glycerol 3 phosphate dehydrogenase, GadFly Accession Number 
CG9042), the human soluble glycerol-3-phosphate dehydrogenase 1, 
CG7956 (GadFly Accession Number), or the human KIAA0966 protein, 



under various conditions of stringency. Hybridization conditions are based 
on the melting temperature (Tm) of the nucleic acid binding complex or 
probe, as taught in Wahl, G. M. and S. L. Berger (1 987: Methods Enzymol. 
152:399-407) and Kimmel, A. R. (1987; Methods Enzymol. 152:507-51 1), 
and may be used at a defined stringency. Preferably, hybridization under 
stringent conditions means that after washing for 1 h with 1 x SSC and 
0.1 % SDS at 50°C, preferably at 55°C / more preferably at 62°C and most 
preferably at 68°C, particularly for 1 h in 0.2 x SSC and 0.1% SDS at 
50°C / preferably at 55 °C, more preferably at 62°C and most preferably at 
68°C, a positive hybridization signal is observed. Altered nucleic acid 
sequences encoding the proteins which are encompassed by the invention 
include deletions, insertions or substitutions of different nucleotides 
resulting in a polynucleotide that encodes the same or a functionally 
equivalent protein. 

The encoded proteins nnay also contain deletions, insertions or 
substitutions of amino acid residues, which produce a silent change and 
result in functionally equivalent proteins. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of 
the residues as long as the biological activity of the protein is retained. 

Also included within the scope of the present invention are alleles of the 
genes encoding CGI 4440, Gpdh, or Synaptojanin-like and homologous 
proteins. As used herein, an 'allele' or 'allelic sequence' is an alternative 
form of the gene, which may result from at least one mutation in the 
nucleic acid sequence. Alleles may result in altered mRNAs or polypeptides 
whose structures or function may or may not be altered. Any given gene 
may have none, one or many allelic forms. Common mutational changes, 
which give rise to alleles, are generally ascribed to natural deletions, 
additions or substitutions of nucleotides. Each of these types of changes 
may occur alone or in combination with the others, one or more times in a 
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given sequence. Methods for DNA sequencing which are well known and 
generally available in the art may be used to practice any embodiments of 
the invention. 

The nucleic acid sequences encoding CG1 4440, Gpdh, or Synaptojan in-like 
and homologous proteins may be extended utilizing a partial nucleotide 
sequence and employing various methods known in the art to detect 
upstream sequences such as promoters and regulatory elements. For 
example, one method which may be employed, 'restriction-site' PCR, uses 
universal primers to retrieve unknown sequence adjacent to a known locus 
(Sarkar, G. (1 993) PCR Methods Applic. 2:31 8-322). Inverse PCR may also 
be used to amplify or extend sequences using divergent primers based on 
a known region (Triglia/T. et al. (1 988) Nucleic Acids Res. 1 6:81 86). The 
primers may be designed using OLIGO 4.06 primer analysis software 
(National Biosciences Inc., Plymouth, Minn.) or another appropriate 
program, to 22-30 nucleotides in length, to have a GC content of 50% or 
more, and to anneal to the target sequence at temperatures about 
68-72 °C. The method uses several restriction enzymes to generate 
suitable fragments. The fragment is then circularized by intramolecular 
ligation and used as a PCR template. 

Another method which may be used is capture PCR which involves PCR 
amplification of DNA fragments adjacent to a known sequence in human 
and yeast artificial chromosome DNA (Lagerstrom, M. et al. (PCR Methods 
Applic. 1:1 11-119). In this method, multiple restriction enzyme digestions 
and ligations also are used to place an engineered double-stranded 
sequence into an unknown portion of the DNA molecule before performing 
PCR. 

Another method which may be used to retrieve unknown sequences is that 
of Parker, J. D. et al. (1991; Nucleic Acids Res. 19:3055-3060). 
Additionally, one may use PCR, nested primers, and PROMOTERFINDER 
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libraries to walk in genomic DNA {Clontech, Palo Alto, Calif.). This process 
avoids the need to screen libraries and is useful in finding intron/exon 
junctions. 

When screening for full-length cDNAs, it is preferable to use libraries that 
have been size-selected to include larger cDNAs. Also, random-primed 
libraries are preferable, in that they will contain more sequences, which 
contain the 5' regions of genes. Use of a randomly primed library may be 
especially preferable for situations in which an oligo d{T) library does not 
yield a full-length cDNA. Genomic libraries may be useful for extension of 
sequence into the 5' and 3' non-transcribed regulatory regions. Capillary 
electrophoresis systems, which are commercially available, may be used to 
analyze the size or confirm the nucleotide sequence of sequencing or PCR 
products. In particular, capillary sequencing may employ flowable polymers 
for electrophoretic separation, four different fluorescent dyes (one for each 
nucleotide) which are laser activated, and detection of the emitted 
wavelengths by a charge coupled devise camera. Output/light intensity 
may be converted to electrical signal using appropriate software (e.g. 
GENOTYPER and SEQUENCE NAVIGATOR, Perkin Elmer) and the entire 
process from loading of samples to computer analysis and electronic data 
display may be computer controlled. Capillary electrophoresis is especially 
preferable for the sequencing of small pieces of DNA, which might be 
present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or 
fragments thereof which encode CG14440, Gpdh, or Synaptojanin-like and 
homologous proteins or fusion proteins or functional equivalents thereof, 
may be used in recombinant DNA molecules to direct expression of the 
proteins in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other DNA sequences, which encode substantially the same 
or a functionally equivalent amino acid sequence may be produced and 
these sequences may be used to clone and express the proteins. As will be 



understood by those of skill in the art, it may be advantageous to produce 
protein-encoding nucleotide sequences possessing non-naturally occurring 
codons. For example, codons preferred by a particular prokaryotic or 
eukaryotic host can be selected to increase the rate of protein expression 
or to produce a recombinant RNA transcript having desirable properties, 
such as a half-life which is longer than that of a transcript generated from 
the naturally occurring sequence. The nucleotide sequences of the present 
invention can be engineered using methods generally known in the art in 
order to alter protein-encoding sequences for a variety of reasons, 
including but not limited to, alterations which modify the cloning, 
processing, and/or expression of the gene product. DNA shuffling by 
random fragmentation and PCR reassembly of gene fragments and 
synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. For example, site-directed mutagenesis may be used to insert 
new restriction sites, alter glycosylation patterns, change codon 
preference, produce splice variants or introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified or recombinant 
nucleic acid sequences encoding CGI 4-440, Gpdh, or Synaptojanin-Iikeand 
homologous proteins may be ligated to a heterologous sequence to encode 
a fusion protein. For example, to screen libraries, e.g. peptide libraries or 
low-molecular weight compound libraries for inhibitors of CG 14440, Gpdh, 
or Synaptojanin-like and homologous protein activities, it may be useful to 
encode chimeric proteins that can be recognized by a commercially 
available antibodies. A fusion protein may also be engineered to contain a 
cleavage site located between the desired protein-encoding sequence and 
the heterologous protein sequence so that the desired protein may be 
cleaved and purified away from the heterologous moiety. In another 
embodiment, sequences encoding the protein may be synthesized, in 
whole or in part, using chemical methods well known in the art (see 
Caruthers, M. H. et al. (1980) NucL Acids Res. Symp. Ser. 7:215-223, 
Horn, T. et al. (1980) Nucl. Acids Res. Symp. Ser. 7:225-232:). 
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Alternatively, the proteins themselves may be produced using chemical 
methods to synthesize the amino acid sequence of the protein or a portion 
thereof. For example, peptide synthesis can be performed using various 
solid-phase techniques (Roberge, J. Y. et al. (1 995) Science 269:202-204) 
5 and automated synthesis may be achieved, for example, using the ABI 
431 A peptide synthesizer (Perkin Elmer). The newly synthesized peptide 
may be substantially purified by preparative high performance liquid 
chromatography (e.g., Creighton, T. (1983) Proteins, Structures and 
Molecular Principles, WH Freeman and Co., New York, N.Y.). The 

10 composition of the synthetic peptides may be confirmed by amino acid 
analysis or sequencing (e.g., the Edman degradation procedure; Creighton, 
supra). Additionally, the amino acid sequences of the proteins or any part 
thereof, may be altered during direct synthesis and/or combined using 
chemical methods with sequences from other proteins or any part thereof, 

15 to produce a variant polypeptide. 

In order to express a biologically active protein, the nucleotide sequences 
encoding the proteins or functional equivalents, may be inserted into 
appropriate expression vectors, i.e., a vector, which contains the 
20 necessary elements for the transcription and translation of the inserted 
coding sequence. Methods, which are well known to those skilled in the 
art, may be used to construct expression vectors containing sequences 
encoding the proteins and sppropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA 

r 

25 techniques, synthetic techniques, and in vivo genetic recombination. Such 
techniques are described in Sambrook, J. et al. (1 989) Molecular Cloning, 
A Laboratory Manual, Cold Spring Harbor Press, Plain view, N.Y., and 
Ausubel, F. M. et al. (1989) Current Protocols in Molecular Biology, John 
Wiley & Sons, New York, N.Y. 

30 

A variety of expression vector/host systems may be utilized to contain and 
express sequences encoding the proteins. These include, but are not 



10 
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limited to, micro-organisms such as bacteria transformed with recombinant 
bacteriophage, plasmid or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected 
with virus expression vectors (e.g., baculovirus); plant cell systems 
transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or PBR322 plasmids); or animal cell systems. The "control 
elements" or "regulatory sequences" are those non-translated regions of 
the vector-enhancers, promoters, 5' and 3' untranslated regions which 
interact with host cellular proteins to carry out transcription and 
translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable 
transcription and translation elements, including constitutive and inducible 
promoters, may be used. For example, when cloning in bacterial systems, 
inducible promoters such as the hybrid lacZ promoter of the BLUESCRIPT 
phagemid (Stratagene, LaJolla, Calif.) or PSPORT1 plasmid (Gibco BRL) and 
the like may be used. The baculovirus polyhedrin promoter may be used in 
insect cells. Promoters and enhancers derived from the genomes of plant 
cells (e.g., heat shock, RUBISCO; and storage protein genes) or from plant 
viruses (e.g., viral promoters and leader sequences) may be cloned into the 
vector. In mammalian cell systems, promoters from mammalian genes or 
from mammalian viruses are preferable. If it is necessary to generate a cell 
line that contains multiple copies of the sequences encoding the protein, 
vectors based on SV40 or EBV may be used with an appropriate selectable 
25 marker. 
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30 



In bacterial systems, a number of expression vectors may be selected 
depending upon the use intended for the protein. For example, when large 
quantities of protein are needed for the induction of antibodies, vectors, 
which direct high level expression of fusion proteins that are readily 
purified, may be used. Such vectors include, but are not limited to, the 
multifunctional E. coli cloning and expression vectors such as the 
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BLUESCRIPT phagemid (Stratagene), in which the sequences encoding the 
protein may be ligated into the vector in frame with sequences for the 
amino-terminal Met and the subsequent 7 residues of B-galactosidase so 
that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. M. 
Schuster (1989) J. Biol. Chem. 264:5503-5509); and the like. PGEX 
vectors (Promega, Madison, Wis.) may also be used to express foreign 
polypeptides as fusion proteins with Glutathione S-Transferase (GST). In 
general, such fusion proteins are soluble and can easily be purified from 
lysed cells by adsorption to glutathione-agarose beads followed by elution 
in the presence of free glutathione. Proteins made in such systems may be 
designed to include heparin, thrombin or factor XA protease cleavage sites 
so that the cloned polypeptide of interest can be released from the GST 
moiety at will. In the yeast, Saccharomyces cerevisiae, a number of 
vectors containing constitutive or inducible promoters such as alpha factor, 
alcohol oxidase, and PGH may be used. For reviews, see Ausubel et al., 
(supra) and Grantet al. (1987) Methods EnzymoL 153:516-544. 



In cases where plant expression vectors are used, the expression of 
sequences encoding the proteins may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 19S 
promoters of CaMV may be used alone or in combination with the omega 
leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-311). 
Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 
3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, 
J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These constructs 
can be introduced into plant cells by direct DNA transformation or 
pathogen-mediated transfection. Such techniques are described in a 
number of generally available reviews (see, for example, Hobbs, S. or 
Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1 992) 
McGraw Hill, New York, N.Y.; pp. 191-196). 
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An insect system may also be used to express the proteins. For example, 
in one such system, Autographa californica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera 
frugiperda cells or in Trichoplusia larvae. The sequences encoding the 
protein may be cloned into a non-essential region of the virus, such as the 
polyhedrin gene, and place under control of the polyhedrin promoter. 
Successful insertions of the protein will render the polyhedrin gene inactive 
and produce recombinant virus lacking coat protein. The recombinant 
viruses may then be used to infect, for example, S, frugiperda cells of 
Trichoplusia larvae in which CG 14440, Gpdh, or Synaptojanin-like and 
homologous proteins may be expressed (Engelhard, E. K. et al. (1994) 
Proc. Nat. Acad. Sci. 91:3224-3227). 

In mammalian host cells, a number of viral-based expression systems may 
be utilized. In cases where an adenovirus is used as an expression vector, 
sequences encoding the protein may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and 
tripartite leader sequence. Insertion in a non-essential E1 or E3 region of 
the viral genome may be used to obtain viable viruses which are capable of 
expressing the protein in infected host cells {Logan, J. and Shenk, T. 
(1984) Proc. Natl. Acad. Sci. 81:3655-3659). In addition, transcription 
enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 

Specific initiation signals may also be used to achieve more efficient 
translation of sequences encoding the protein. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences 
encoding the protein, its initiation codons, and upstream sequences are 
inserted into the appropriate expression vector, no additional transcriptional 
or translational control signals may be needed. However, in cases where 
only coding sequence or a portion thereof, is inserted, exogenous 
translational control signals including the ATG initiation codon should be 
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provided. Furthermore, the initiation codon should be in the correct reading 
frame to ensure translation of the entire insert. Exogenous translational 
elements and initiation codons may be of various origins, both natural and 
synthetic. The efficiency of expression may be enhanced by the inclusion 
of enhancers which are appropriate for the particular cell system which is 
used, such as those described in the literature (Scharf, D. et al. (1994) 
Results Probl. Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate the 
expression of the inserted sequences or to process the expressed protein 
in the desired fashion. Such modifications of the polypeptide include, but 
are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-tra relational processing 
which cleaves a 'prepro' form of the protein may also be used to facilitate 
correct insertion, folding and/or function. Different host cells such as CHO, 
HeLa, MDCK, HEK293, and WI38, which have specific cellular machinery 
and characteristic mechanisms for such post-translational activities, may 
be chosen to ensure the correct modification and processing of the foreign 
protein-, . . . 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. For example, celi lines which stably express 
CG 14440, Gpdh, or Synaptojanin-like and homologous proteins may be 
generated by transformation using expression vectors which may contain 
viral origins of replication and/or endogenous expression elements and a 
selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for 1-2 days in an 
enriched media before they are switched to selective media. The purpose 
of the selectable marker is to confer resistance to selection, and its 
presence allows growth and recovery of cells, which successfully express 
the introduced sequences. Resistant clones of stably transformed cells may 
be proliferated using tissue culture techniques appropriate to the cell type. 
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Any number of selection systems may be used to recover transformed cell 
lines. These include, but are not limited to, the herpes simplex virus 
thymidine kinase (Wigler, M. et al. (1977) Cell 1 1:223-32) and adenine 
phosphoribosyltransferase (Lowy, I. et al. (1980) Cell 22:817-23) genes, 
which can be employed in tk-or aprt-,cells, respectively. Also, 
antimetabolite, antibiotic or herbicide resistance can be used as the basis 
for selection; for example, dhfr which confers resistance to methotrexate 
(Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-70); npt, which 
confers resistance to the aminoglycosides neomycin and G-4-1 8 
(Colbere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14) and als or pat, 
which confer resistance to chlorsulfuron and phosphinotricin 
acetyltransferase, respectively (Murry, supra). Additional selectable genes 
have been described, for example, trpB, which allows cells to utilise indole 
in place of tryptophan or hisD, which allows cells to utilise histinol in place 
of histidine (Hartman, S. C. and R. C. Mulligan (1 988) Proc. Natl! Acad. 
Sci. 85:8047-51). Recently, the use of visible markers has gained 
popularity with such markers as anthocyanins, IS- glucuronidase and its 
substrate GUS, and luciferase and its substrate luciferin, being widely used 
not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector 
system (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need 
to be confirmed. For example, if the sequences encoding the protein of 
interest are inserted within a marker gene sequence, recombinant cells 
containing sequences encoding the protein can be identified by the 
absence of marker gene function. Alternatively, a marker gene can be 
placed in tandem with sequences encoding the protein under the control of 
a single promoter. Expression of the marker gene in response to induction 
or selection usually indicates expression of the tandem gene as well. 
Alternatively, host cells, which contain and express the nucleic acid 
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sequences encoding the protein may be identified by a variety of 
procedures known to those of skill in the art. These procedures include, 
but are not limited to, DNA-DNA or DNA-RNA hybridization and protein 
bioassay or immunoassay techniques that include membrane, solution or 
chip based technologies for the detection and/or quantification of nucleic 
acid or protein. 

The presence of polynucleotide sequences encoding CG1 44-40, Gpdh, or 
Synaptojanin-like and homologous proteins can be detected by DNA-DNA 
or DNA-RNA hybridization or amplification using probes or portions or 
fragments of polynucleotides encoding CG14440, Gpdh, or 
Synaptojanin-like and homologous proteins. Nucleic acid amplification 
based assays involve the use of oligonucleotides or oligomers based on the 
sequences specific for the gene to detect transformants containing DNA or 
RNA encoding the corresponding protein. As used herein 'oligonucleotides' 
or 'oligomers' refer to a nucleic acid sequence of at least about 10 
nucleotides and as many as about 60 nucleotides, preferably about 15 to 
30 nucleotides, and more preferably about 20-25 nucleotides, which can 
be used as a probe or amplimer. 

A variety of protocols for detecting and measuring the expression of 
proteins, using either polyclonal or monoclonal antibodies specific for the 
protein are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two non-interfering epitopes on 
the protein is preferred, but a competitive binding assay may be employed. 
These and other assays are described, among other places, in Hampton, R. 
et al. (1990; Serological Methods, a Laboratory Manual, APS Press, St 
Paul, Minn.) and Maddox, D. E. et al. (1983; J. Exp. Med. 
158:1211-1216). 
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A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid 
assays. Means for producing labeled hybridization or PCR probes for 
detecting sequences related to polynucleotides encoding CG14440, Gpdh, 
or Synaptojanin-like and homologous proteins include oligo-labeling, nick 
translation, end-labeling or PCR amplification using a labeled nucleotide. 

Alternatively, the sequences encoding the protein or any portions thereof 
may be cloned into a vector for the production of an mRNA probe. Such 
vectors are known in the art, are commercially available, and maybe used 
to synthesize RNA probes in vitro by addition of an appropriate RNA 
polymerase such as 17, T3 or SP6 and labeled nucleotides. These 
procedures may be conducted using a variety of commercially available kits 
(Pharmacia & Upjohn, (Kalamazoo, Mich.); Promega (Madison Wis.); and 
U.S. Biochemical Corp., (Cleveland, Ohio). 

Suitable reporter molecules or labels, which may be used, include 
radionuclides, enzymes, fluorescent, chemiluminescent or chromogenic 
agents as well as substrates, co-factors, inhibitors, magnetic particles, arid 
the like. 

Host cells transformed with nucleotide sequences encoding the protein 
may be cultured under conditions suitable for the expression and recovery 
of the protein from cell culture. The protein produced by a recombinant cell 
may be secreted or contained intracellular^ depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, 
expression vectors containing polynucleotides which encode the protein 
may be designed to contain signal sequences, which direct secretion of the 
protein through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding the 
protein to nucleotide sequence encoding a polypeptide domain, which will 
facilitate purification of soluble proteins. Such purification facilitating 
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domains include, but are not limited to, metal chelating peptides such as 
histidine-tryptophan modules that allow purification on immobilized metals, 
protein A domains that allow purification on immobilized immunoglobulin, 
and the domain utilized in the FLAG extension/affinity purification system 
(Immunex Corp., Seattle, Wash.) The inclusion of cleavable linker 
sequences such as those specific for Factor XA or Enterokinase 
(Invitrogen, San Diego, Calif.) between the purification domain and the 
desired protein may be used to facilitate purification. One such expression 
vector provides for expression of a fusion protein containing the desired 
protein and a nucleic acid encoding 6 histidine residues preceding a 
thioredoxine or an Enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilised metal ion affinity 
chromatography as described in Porath, J. etal. (1992, Prot. Exp. Purif. 3: 
263-281)) while the Enterokinase cleavage site provides a means for 
purifying the desired protein from the fusion protein. A discussion of 
vectors which are suitable for the production of fusion proteins is provided 
in Kroll, D. J. et al. (1993; DNA Ceil Biol. 12:441-45 3). In addition to 
recombinant production, fragments of the proteins may be produced by 
direct peptide synthesis using solid-phase techniques (Merrifield J. (1963) 
J. Am. Chem. Soc. 85:2149-2154). Protein synthesis may be performed 
using manual techniques or by automation. Automated synthesis may be 
achieved, for example, using Applied Biosystems 431 A peptide synthesizer 
(Perkin Elmer). Various fragments of the proteins may be chemically 
synthesized separately and combined using chemical methods to produce 
the full length molecule. 

Diagnostics and Therapeutics 

The data disclosed in this invention show that the nucleic acids and 
proteins of the invention are useful in diagnostic and therapeutic 
applications implicated, for example but not limited to, in metabolic 
disorders such as obesity as well as related disorders such as eating 
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disorder, cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. 
cancers of the reproductive organs, and sleep apnea. Hence, diagnostic 
and therapeutic uses for the CG14440, Gpdh, or Synaptojanin-like nucleic 
acids and proteins of the invention are, for example but not limited to, the 
following: (i) protein therapeutic, (ii) small molecule drug target, (Hi) 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), 
(iv) diagnostic and/or prognostic marker, (v) gene therapy (gene 
delivery/gene ablation), (vi) research tools, and (vii) tissue regeneration in 
vitro and in vivo (regeneration for all these tissues and cell types 
composing these tissues and cell types derived from these tissues). 

The nucleic acids and proteins of the invention are useful in diagnostic and 
therapeutic applications implicated in various applications as described 
below. For example, but not limited to, cDNAs encoding the CG14440, 
Gpdh, or Synaptojanin-like proteins of the invention and particularly their 
human homologues may be useful in gene therapy, and the CG14440, 
Gpdh, or Synaptojanin-like proteins of the invention and particularly their 
human homologues may be useful when administered to a subject in need 
thereof. By way of non-limiting example, the compositions of the present 
invention will have efficacy for treatment of patients suffering from, for 
example, but not limited to, in metabolic disorders as described above. 

The novel nucleic acid encoding the CG 1 4440, Gpdh, or Synaptojanin-like 
protein of the invention or homologous proteins or fragments thereof, may 
further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acids or the proteins are to be assessed. These 
materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 
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For example, in one aspect, antibodies which are specific for CG14440, 
Gpdh, or Synaptojanin-like and homologous proteins may be used directly 
as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissue which express the 
protein. The antibodies may be generated using methods that are well 
known in the art. Such antibodies may include, but are not limited to, 
polyclonal, monoclonal, chimerical, single chain. Fab fragments, and 
fragments produced by a Fab expression library. Neutralising antibodies, 
(i.e., those which inhibit dimer formation) are especially preferred for 
therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, 
• rats, mice, humans, and others, may be immunized by injection with the 
protein or any fragment or oligopeptide thereof which has immunogenic 
properties. Depending on the host species, various adjuvants may be used 
to increase immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminium hydroxide, and surface 
active substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil. emulsions,. Jcey hole limpet hemocyanin, _and dinitrophenol. . 
Among adjuvants used in human, BCG (BaciHe Calmette-Guerin) and 
Corynebacterium parvum are especially preferable. It is preferred that the 
peptides, fragments or oligopeptides used to induce antibodies to the 
protein have an amino acid sequence consisting of at least five amino 
acids, and more preferably at least 10 amino acids. It is preferable that 
they are identical to a portion of the amino acid sequence of the natural 
protein, and they may contain the entire amino acid sequence of a small, 
naturally occurring molecule. Short stretches of CG 14440, Gpdh, or 
Synaptojanin-like and homologous protein amino acids may be fused with 
those of another protein such as keyhole limpet hemocyanin in order to 
increase the immunogenicity. 



Monoclonal antibodies to the proteins may be prepared using any 
technique that provides for the production of antibody molecules by 
continuous cell lines in culture. These include, but are not limited to, the 
hybridoma technique, the human B-cell hybridoma technique, and the 
EBV-hybridoma technique (Kdhler, G. et al. (1975) Nature 256:495-497; 
Kozbor, D. et al. (1985) J. Immunol. Methods 81 :31-42; Cote, R. J. et al. 
Proc. Natl. Acad. Sci. 80:2026-2030; Cole, S. P. et al. (1984) Mol. Cell 
Biol. 62:109-120). 
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In addition, techniques developed for the production of 'chimeric 
antibodies', the splicing of mouse antibody genes to human antibody genes 
to~ obtain a molecule with appropriate antigen specificity and biological 
activity can be used (Morrison, S. L. et al. (1984) Proc. Natl. Acad. Sci. 
81:6851-6855; Neuberger, M. S. et al (1984) Nature 312:604-608; 
Takeda, S. et al. (1985) Nature 314:452-454). Alternatively, techniques 
described for the production of single chain antibodies may be adapted, 
using methods known in the art, to produce single chain antibodies 
specific for CG14440, Gpdh, or Synaptojanin-like and homologous 
proteins. Antibodies with related specificity, but of distinct idiotypic 
composition, may be generated by chain shuffling from random 
combinatorial immunoglobulin libraries (Burton, D. R. (1991) Proc. Natl. 
Acad. Sci. 88:1 1 120-3). Antibodies may also be produced by inducing in 
vivo production in the lymphocyte population or by screening recombinant 
immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature (Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299). 

Antibody fragments which contain specific binding sites for the proteins 
may also be generated. For example, such fragments include, but are not 
limited to, the F(ab') 2 fragments which can be produced by Pepsin 
digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of F(ab') 2 fragments. 
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Alternatively, Fab expression libraries may be constructed to allow rapid 
and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse, W. D. et al. (1989) Science 254:1 275-1281). 

s Various immunoassays may be used for screening to identify antibodies 
having the desired specificity. Numerous protocols for competitive binding 
and immunoradiometric assays using either polyclonal or monoclonal 
antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation 
10 between the protein and its specific antibody. A * two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to 
two non-interfering protein epitopes are preferred, but a competitive 
. binding assay may also be employed (Maddox, supra). 

is In another embodiment of the invention, the polynucleotides encoding 
CG 14440, Gpdh, or Synaptojanin-like and homologous proteins or any 
fragment thereof or antisense molecules, may be used for therapeutic 
purposes. In one aspect, antisense molecules may be used in situations in 
whicfr. It would be, desirable to block the transcription of- the mRNA< In 

20 particular, cells may be transformed with sequences complementary to 
polynucleotides encoding CG 14440, Gpdh, or Synaptojanin-like and 
homologous proteins. Thus, antisense molecules may be used to modulate 
protein activity or to achieve regulation of gene function. Such technology 
is now well know in the art, and sense or antisense oligomers or larger 

25 fragments, can be designed from various locations along the coding or 
control regions of sequences encoding the proteins. Expression vectors 
derived from retroviruses, adenovirus, herpes or vaccinia viruses or from 
various bacterial plasmids may be used for delivery of nucleotide 
sequences to the targeted organ, tissue or cell population. Methods, which 

30 are well known to those skilled in the art, can be used to construct 
recombinant vectors, which will express antisense molecules 
complementary to the polynucleotides of the genes encoding CG 14440, 
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Gpdh, or Synaptojanin-like and homologous proteins. These techniques are 
described both in Sambrook et al. (supra) and in Ausubel et al. (supra). 
Genes encoding CGI 4440, Gpdh, or Synaptojanin-like and homologous 
proteins can be turned off by transforming a cell or tissue with expression 
vectors which express high levels of polynucleotide which encodes 
CG 14440, Gpdh, or Synaptojanin-like and homologous proteins or 
fragments thereof. Such constructs may be used to introduce 
untranslatable sense or antisense sequences into a cell. Even in the 
absence of integration into the DNA, such vectors may continue to 
transcribe RNA molecules until they are disabled by endogenous nucleases. 
Transient expression may last for a month or more with a non-replicating 
vector and even longer if appropriate replication elements are part of the 
vector system. 



As mentioned above, modifications of gene expression can be obtained by 
designing antisense molecules, e.g. DNA, RNA or PNA, to the control 
regions of the genes encoding CGI 4440, Gpdh, or Synaptojanin-like and 
homologous proteins, i.e., the promoters, enhancers, and introns. 
Oligonucleotides derived from the transcription initiation site, e.g., between 
positions -10 and +10 from the start site, are preferred. Similarly, 
inhibition can be achieved using "triple helix" base-pairing methodology. 
Triple helix pairing is useful because it cause inhibition of the ability of the 
double helix to open sufficiently for the binding of polymerases, 
transcription factors or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature (Gee, J. E. et al. 
(1994) In; Huber, B. E. and B. I. Carr, Molecular and Immunologic 
Approaches, Futura Publishing Co., Mt. Kisco, N.Y.). The antisense 
molecules may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 



Ribozymes, enzymatic RNA molecules, may also be used to catalyze the 
specific cleavage of RNA. The mechanism of ribozyme action involves 
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sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Examples, which may 
be used, include engineered hammerhead motif ribozyme molecules that 
can be specifically and efficiently catalyze endonucleolytic cleavage of 

5 sequences encoding CG1 4440, Gpdh, or Synaptojanin-Iike and homologous 
proteins. Specific ribozyme cleavage sites within any potential RNA target 
are initially identified by scanning the target molecule for ribozyme 
cleavage sites which include the following sequences: GUA, GUU, and 
GUC. Once identified, short RNA sequences of between 15 and 20 

10 ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structural features which 
may render the oligonucleotide inoperable. The suitability of candidate 
targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

15 

Antisense molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. 
These include techniques for chemically synthesizing oligonucleotides such 
as solid phase phosphoramidite chemical synthesis. Alternatively, RNA 

20 molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding CG 14440, Gpdh, or Synaptojanin-Iike and homologous 
proteins. Such DNA sequences may be incorporated into a variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. 
Alternatively, these cDNA constructs that synthesize antisense RNA 

25 constitutively or inducibly can be introduced into cell lines „ cells or tissues. 
RNA molecules may be modified to increase intracellular stability and 
half-life. Possible modifications include, but are not limited to, the addition 
of flanking sequences at the 5' and/or 3' ends of the molecule or the use 
of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages 

30 within the backbone of the molecule. This concept is inherent in the 
production of PNAs and can be extended in all of these molecules by the 
inclusion of non-traditional bases such as inosine, queosine, and 
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wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
of adenine, cytidine, guanine, thymine, and uridine which are not as easily 
recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and 
equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, 
vectors may be introduced into stem cells taken from the patient and 
clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection and by liposome injections may be achieved using 
methods, which are well known in the art. Any of the therapeutic methods 
described above may be applied to any suitable subject including, for 
example, mammals such as dogs, cats, cows, horses, rabbits, monkeys, 
and most preferably, humans. 

An additional embodiment of the invention relates to the administration of 
a pharmaceutical composition, in conjunction with a pharmaceutical ly 
acceptable carrier, for any of the therapeutic effects discussed above. 
Such pharmaceutical compositions may consist of CG 14440, Gpdh, or 
Synaptojanin-like and homologous nucleic acids or proteins, antibodies to 
CG 14440, Gpdh, or Synaptojanin-like and homologous proteins, mirnetics, 
agonists, antagonists or inhibitors of CG14440, Gpdh, or Synaptojanin-like 
and homologous proteins or nucleic acids. The compositions may be 
administered alone or in combination with at least one other agent, such as 
stabilizing compound, which may be administered in any sterile, 
biocompatible pharmaceutical carrier, including, but not limited to, saline, 
buffered saline, dextrose, and water. The compositions may be 
administered to a patient alone or in combination with other agents, drugs 
or hormones. The pharmaceutical compositions utilized in this invention 
may be administered by any number of routes including, but not limited to, 
oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal,' 
enteral, topical, sublingual or rectal means. 
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In addition to the active ingredients, these pharmaceutical compositions 
may contain suitable pharmaceutically-acceptable carriers comprising 
excipients and auxiliaries, which facilitate processing of the active 
compounds into preparations which, can be used pharmaceutically. Further 
details on techniques for formulation and administration may be found in 
the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Easton, Pa.). Pharmaceutical compositions for oral 
administration can be formulated using pharmaceutically acceptable carriers 
well known in the art in dosages suitable for oral administration. Such 
carriers enable the pharmaceutical compositions to be formulated as 
tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, 
and the like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through 
combination of active compounds with solid excipient, optionally grinding 
a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable 
excipients are carbohydrate or protein fillers, such as sugars, including 
lactose, sucrose, mannitpJLor sorbitol; starch from corn, wheat, rice, potato . 
or other plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyl-cellulose or sodium carboxyrnethylcellulose; gums 
including Arabic and tragacanth; and proteins such as gelatine and 
collagen. If desired, disintegrating or solubilising agents may be added, 
such as the cross-linked polyvinyl pyrrolidone, agar, alginic acid or a salt 
thereof, such as sodium alginate. Dragee cores may be used in conjunction 
with suitable coatings, such as concentrated sugar solutions, which may 
also contain gum Arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable 
organic solvents or solvent mixtures. Dyestuffs or pigments may be added 
to the tablets or dragee coating for product identification or to characterize 
the quantity of active compound, i.e., dosage. Pharmaceutical 
preparations, which can be used orally, include push-fit capsules made of 
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gelatine, as well as soft, sealed capsules made of gelatine and a coating, 
such as glycerol or sorbitol- Push-fit capsules can contain active 
ingredients mixed with a filler or binders, such as lactose or starches, 
lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. 
In soft capsules, the active compounds may be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid or liquid polyethylene glycol with 
or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be 
formulated in aqueous solutions, preferably in physiologically compatible 
buffers such as Hanks' solution, Ringer's solution or physiologically 
buffered saline. Aqueous injection suspensions may contain substances, 
which increase the viscosity of the suspension, such as sodium 
carboxymethyl cellulose, sorbitol or dextran. Additionally, suspensions of 
the active compounds may be prepared as appropriate oily injection 
suspensions. Suitable lipophilic solvents or vehicles include fatty oils such 
as sesame oil or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides or liposomes. Optionally, the suspension may also contain 
suitable stabilizers or agents who increase the solubility of the compounds 
to allow for the preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular 
barrier to be permeated are used in the formulation. Such penetrants are 
generally known in the art. 

The pharmaceutical compositions of the present invention may be 
manufactured in a manner that is known in the art, e.g., by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing processes. The 
pharmaceutical composition may be provided as a salt and can be formed 
with many acids, including but not limited to, hydrochloric, sulphuric, 
acetic, lactic, tartaric, malic, succinic, etc. Salts tend to be more soluble in 
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aqueous or other protonic solvents than are the corresponding free base 
forms. In other cases, the preferred preparation may be a lyophilized 
powder which may contain any or all of the following: 1-50 mM histidine, 
0.1 %-2% sucrose, and 2-7% mannitol, at a pH range of 4.5 to 5.5, that is 
combined with buffer prior to use. After pharmaceutical compositions have 
been prepared, they can be placed in an appropriate container and labeled 
for treatment of an indicated condition. For administration of proteins, such 
labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include 
compositions wherein the active ingredients are contained in an effective 
amount to achieve the intended purpose. The determination of an effective 
dose is well within the capability of those skilled in the art. For any 
compounds, the therapeutically effective does can be estimated initially 
either in cell culture assays, e.g., of preadipocyte cell lines or in animal 
models, usually mice, rabbits, dogs or pigs. The animal model may also be 
used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful 
S!pse s and routes for administration in humans. A therapeutically effective 
dose refers to that amount of active ingredient, for example CG14440, 
Gpdh, or Synaptojanin-like and homologous proteins or nucleic acids or 
fragments thereof, antibodies of CG1444-0, Gpdh, or Synaptojanin-like and 
homologous proteins, which is sufficient for treating a specific condition. 
Therapeutic efficacy and toxicity may be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., 
ED50 (the dose therapeutically effective in 50% of the population) and 
LD50 (the dose lethal to 50% of the population). The dose ratio between 
therapeutic and toxic effects is the therapeutic index, and it can be 
expressed as the ratio, LD50/ED50. Pharmaceutical compositions, which 
exhibit large therapeutic indices, are preferred. The data obtained from cell 
culture assays and animal studies is used in formulating a range of dosage 
for human use. The dosage contained in such compositions is preferably 
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within a range of circulating concentrations that include the ED50 with 
little or no toxicity. The dosage varies within this range depending upon the 
dosage from employed, sensitivity of the patient, and the route of 
administration. The exact dosage will be determined by the practitioner, in 
light of factors related to the subject that requires treatment. Dosage and 
administration are adjusted to provide sufficient levels of the active moiety 
or to maintain the desired effect. Factors, which may be taken into 
account, include the severity of the disease state, general health of the 
subject, age, weight, and gender of the subject, diet, time and frequency 
of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions 
may be administered every 3 to 4 days, every week or once every two 
weeks depending on half-life and clearance rate of the particular 
formulation. Normal dosage amounts may vary from 0.1 to 100,000 
micrograms, up to a total dose of about 1 g, depending upon the route of 
administration. Guidance as to particular dosages and methods of delivery 
is provided in the literature and generally available to practitioners in the 
art. Those skilled in the art employ different formulations for nucleotides 
than for proteins or their inhibitors. Similarly, delivery of polynucleotides or 
polypeptides will be specific to particular cells, conditions, locations, etc. 

In another embodiment, antibodies which specifically bind to the proteins 
may be used for the diagnosis of conditions or diseases characterized by or 
associated with over- or underexpression of CG 14440, Gpdh, or 
Synaptojanin-like and homologous proteins or in assays to monitor patients 
being treated with CG14440, Gpdh, or Synaptojanin-like and homologous 
proteins, agonists, antagonists or inhibitors. The antibodies useful for 
diagnostic purposes may be prepared in the same manner as those 
described above for therapeutics. Diagnostic assays include methods which 
utilize the antibody and a label to detect the protein in human body fluids 
or extracts of cells or tissues. The antibodies may be used with or without 
modification, and may be labeled by joining them, either covalently or 
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non-covalently, with a reporter molecule. A wide variety of reporter 
molecules which are known in the art may be used several of which are 
described above. 

A variety of protocols including ELISA, RIA, and FACS for measuring 
proteins are known in the art and provide a basis for diagnosing altered or 
abnormal levels of gene expression. Normal or standard values for gene 
expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, preferably human, with antibodies to the 
protein under conditions suitable for complex formation. The amount of 
standard complex formation may be quantified by various methods, but 
preferably by photometry, means. Quantities of protein expressed in 
control and disease, samples from biopsied tissues are compared with the 
standard values. Deviation between standard and subject values 
establishes the parameters for diagnosing disease. 



In another embodiment of the invention, the polynucleotides specific for 
CG 14440, Gpdh, or Synaptojanin-like and homologous proteins may be 
used for diagnostic purposes. The polynucleotides, which may be used, 
include oligonucleotide sequences, antisense RNA and DNA molecules, and 
PNAs. The polynucleotides may be used to detect and quantitate gene 
expression in biopsied tissues in which gene expression may be correlated 
with disease. The diagnostic assay may be used to distinguish between 
absence, presence, and excess gene expression, and to monitor regulation 
of protein levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of 
detecting polynucleotide sequences, including genomic sequences, 
encoding CG 14440, Gpdh, or Synaptojanin-like and homologous proteins 
or closely related molecules, may be used to identify nucleic acid 
sequences which encode the respective protein. The specificity of the 
probe, whether it is made from a highly specific region, e.g., unique 
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nucleotides in the 5' regulatory region or a less specific region, e.g., 
especially in the 3' coding region, and the stringency of the hybridization or 
amplification {maximal, high, intermediate or low) will determine whether 
the probe identifies only naturally occurring sequences, alleles or related 
sequences. Probes may also be used for the detection of related 
sequences, and should preferably contain at least 50% of the nucleotides 
from any of the CG 14440, Gpdh, or Synaptojan in-like and homologous 
protein-encoding sequences- The hybridization probes of the subject 
invention may be DNA or RNA and derived from the nucleotide sequence 
of the polynucleotide encoding CG14440 (GadFly Accession Number), the 
human hypothetical protein LOC55565, Gpdh (Glycerol 3 phosphate 
dehydrogenase, GadFly Accession Number CG9042), the human soluble 
glycerol-3-phosphate dehydrogenase 1, CG7956 (GadFly Accession 
Number), or the human KIAA0966 protein or from a genomic sequence 
including promoter, enhancer elements, and introns of the naturally 
occurring gene. Means fo_r_. producing specific hybridization probes for 
DNAs encoding CG 14440, Gpdh, or Synaptojan in-like and homologous 
proteins include the cloning of nucleic acid sequences specific for 
CGJ4440, Gpdh, or Synaptojanin-like and homologous proteins into 
vectors for the production of mRNA probes. Such vectors are known in the 
art, commercially available, and may be used to synthesize RNA probes in 
vitro by means of the addition of the appropriate RISIA polymerases and the 
appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, radionuclides such as 32P or 35S 
or enzymatic labels, such as alkaline phosphatase coupled to the probe via 
avidin/biotin coupling systems, and the like. 

Polynucleotide sequences specific for CG 1 4440, Gpdh, or Synaptojanin-like 
and homologous nucleic acids may be used for the diagnosis of conditions 
or diseases, which are associated with the expression of the proteins. 
Examples of such conditions or diseases include, but are not limited to, 
pancreatic diseases and disorders, including diabetes. Polynucleotide 
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sequences specific for CG1444-0, Gpdh, or Synaptojanin-like and 
homologous proteins may also be used to monitor the progress of patients 
receiving treatment for pancreatic diseases and disorders, including 
diabetes. The polynucleotide sequences may be used in Southern or 
5 Northern analysis, dot blot or other membrane-based technologies; in PCR 
technologies; or in dip stick, pin, ELISA or chip assays utilizing fluids or 
tissues from patient biopsies to detect altered gene expression. Such 
qualitative or quantitative methods are well known in the art. 

10 In a particular aspect, the nucleotide sequences specific for CG 14440, 
Gpdh, or Synaptojanin-like and homologous nucleic acids may be useful in 
assays that detect activation or induction of various metabolic diseases 
such as obesity as well as related disorders such as eating disorder, 
cachexia, diabetes mellitus, hypertension, coronary heart disease, 

15 hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancers of 
the reproductive organs, and sleep apnea. The nucleotide sequences may 
be labeled by standard methods, and added to a fluid or tissue sample from 
a patient under conditions suitable for the formation of hybridization 
.complexes. After.a suitable incubation period, the sample is. washed and 

20 the signal is quantitated and compared with a standard value. If the 
amount of signal in the biopsied or extracted sample is significantly altered 
from that of a comparable have hybridized with nucleotide sequences in 
the sample, and the presence of altered levels of nucleotide sequences 
encoding CG14440, Gpdh, or Synaptojanin-like and homologous proteins 

25 in the sample indicates the presence of the associated disease. Such 
assays may also be used to evaluate the efficacy of a particular therapeutic 
treatment regimen in animal studies, in clinical trials or in monitoring the 
treatment of an individual patient. 

30 In order to provide a basis for the diagnosis of a disease associated with 
expression of CG 14440, Gpdh, or Synaptojanin-like and homologous 
proteins, a normal or standard profile for expression is established. This 
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may be accomplished by combining body fluids or cell extracts taken from 
normal subjects, either animal or human, with a sequence or a fragment 
thereof, which is specific for CG14440, Gpdh, or Synaptojan in-like and 
homologous nucleic acids, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the 
values obtained from normal subjects with those from an experiment where 
a known amount of a substantially purified polynucleotide is used. 
Standard values obtained from normal samples may be compared with 
values obtained from samples from patients who are symptomatic for 
disease. Deviation between standard and subject values is used to 
establish the presence of disease. Once disease is established and a 
treatment protocol is initiated, hybridization assays may be repeated on a 
regular basis to evaluate whether the level of expression in the patient 
begins to approximate that, which is observed in the normal patient. The 
results obtained from successive assays may be used to show the efficacy 
of treatment over a period ranging from several days to months . 



20 
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With respect to metabolic diseases such as obesity as well as related 
disorders -such as eating disorder, cachexia, diabetes mellitus, 
hypertension, coronary heart disease, hypercholesterolemia, dyslipidemia, 
osteoarthritis, gallstones, cancers of the reproductive organs, and sleep 
apnea the presence of a relatively high amount of transcript in biopsied 
tissue from an individual may indicate a predisposition for the development 
of the disease or may provide a means for detecting the disease prior to 
the appearance of actual clinical symptoms. A more definitive diagnosis of 
this type may allow health professionals to employ preventative measures 
or aggressive treatment earlier thereby preventing the development or 
further progression of the pancreatic diseases and disorders. Additional 
diagnostic uses for oligonucleotides desig ned from the sequences encoding 
CQ 14440, Gpdh, or Synaptojanin-like and homologous proteins may 
involve the use of PCR. Such oligomers may be chemically synthesized, 
generated enzymatically or produced from a recombinant source. Oligomers 
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will preferably consist of two nucleotide sequences, one with sense 
orientation (5'.fwdarw.3') and another with antisense (3'.rarw.5'), 
employed under optimized conditions for identification of a specific gene or 
condition. The same two oligomers, nested sets of oligomers or even a 
degenerate pool of oligomers may be employed under less stringent 
conditions for detection and/or quantification of closely related DNA or 
RNA sequences. 

Methods which may also be used to quantitate the expression of 
CG 14440, Gpdh, or Synaptojanin-like include radiolabeling or biotinylating 
nucleotides, coamplification of a control nucleic acid, and standard curves 
onto which the experimental results are interpolated (Melby, P. C. et al. 
(1 993) J. Immunol. Methods, 1 59:235-244; Duplaa, C. et al. (1 993) Anal. 
Biochem. 212:229-236). The speed of quantification of multiple samples 
may be accelerated by running the assay in an ELISA format where the 
oligomer of interest is presented in various dilutions and a 
spectrophotometric or colorimetric response gives rapid quantification. 

In another embodiment of the invention, the nucleic acid sequences which 
are sprecific for CG 14440, Gpdh, or Synaptojanin-like and homologous 
nucleic acids may also be used to generate hybridization probes, which are 
useful for mapping the naturally occurring genomic sequence. The 
sequences may be mapped to a particular chromosome or to a specific 
region of the chromosome using well known techniques. Such techniques 
include FISH, FACS or artificial chromosome constructions, such as yeast 
artificial chromosomes, bacterial artificial chromosomes, bacterial P1 
constructions or single chromosome cDN A libraries as reviewed in Price, C. 
M. (1993) Blood Rev. 7:127-134, and Trask, B. J. {1 991) Trends Genet. 
7:149-154. FISH (as described in Verma et al. (1988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York, 
N.Y.) may be correlated with other physical chromosome mapping 
techniques and genetic map data. Examples of genetic map data can be 
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found in the 1994 Genome Issue of Science (265:1 981f). Correlation 
between the location of the gene encoding CG 14440, Gpdh, or 
Synaptojanin-like on a physical chromosomal map and a specific disease or 
predisposition to a specific disease, may help to delimit the region of DNA 
associated with that genetic disease. 

The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected 
individuals. In situ hybridization of chromosomal preparations and physical 
mapping techniques such as linkage analysis using established 
chromosomal markers may be used for extending genetic maps. Often the 
placement of a gene on the chromosome of another mammalian species, 
such as mouse, may reveal associated markers even if the number or arm 
of a particular human chromosome is not known. New sequences can be 
assigned to chromosomal arms or parts thereof, by physical mapping. This 
provides valuable information to investigators searching for disease genes 
using positional cloning or other gene discovery techniques. Once the 
disease or syndrome has been crudely localized by genetic linkage to a 
particular genomic region, for example, AT to 1 1 q22-23 (Gatti, R. A-JStal.- 
(1988) Nature 336:577-580), any sequences mapping to that area may 
represent associated or regulatory genes for further investigation. The 
nucleotide sequences of the subject invention may also be used to detect 
differences in the chromosomal location due to translocation, inversion, 
etc. among normal, carrier or affected individuals. 

In another embodiment of the invention, CG14440, Gpdh, or 
Synaptojanin-like and homologous proteins, their catalytic or immunogenic 
fragments or oligopeptides thereof, can be used for screening libraries of 
compounds, e.g. peptides or low-molecular weight organic compounds, in 
any of a variety of drug screening techniques. The fragment employed in 
such screening may be free in solution, affixed to a solid support, borne on 
a cell surface or located intracellular^. The . formation of binding 
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complexes, between CG 14440, Gpdh, or Synaptojanin-like and 
homologous proteins and the agent tested, may be measured. 

Another technique for drug screening, which may be used, provides for 
high throughput screening of compounds having suitable binding affinity to 
the protein of interest as described in published PCT application 
W084/03564-. In this method, as applied to CG 14440, Gpdh, or 
Synaptojanin-like and homologous proteins large numbers of different small 
test compounds, e.g. peptidic compounds or low-molecular weight organic 
molecules, are provided or synthesized on a solid substrate, such as plastic 
pins or some other surface. The test compounds are reacted with the 
proteins or fragments thereof, and washed. Bound proteins are then 
detected by methods well known in the art. Purified proteins can also be 
coated directly onto plates for use in the aforementioned drug screening 
techniques. Alternatively, non-neutralizing antibodies can be used to 
capture the peptide and immobilize it on a solid support. In another 
embodiment, one may use competitive drug screening assays in which 
neutralizing antibodies capable of binding the protein specifically compete 
with a .test compound for binding the protein. In this manner, the- 
antibodies can be used to detect the presence of any peptide, which 
shares one or more antigenic determinants with the protein. In additional 
embodiments, the nucleotide sequences which are specific for CG1 4440, 
Gpdh, or Synaptojanin-like and homologous nucleic acids or proteins 
encoded thereby may be used in any molecular biology techniques that 
have yet to be developed, provided the new techniques rely on properties 
of nucleotide that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

The Figures show: 

FIGURE 1 shows the increase of triglyceride content of PX10162.1 flies 
CPX101 62. T, column 2) caused by homozygous viable integration of the 
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P-vector upstream of the CG 14440 gene (in comparison to controls 
without integration of this vector, 'PX-control', column 1). 

FIGURE 2 shows the molecular organization of the mutated CG144-40 
(Gadfly Accession Number) gene locus. 

FIGURE 3 shows the BLASTP search result for the CGI 4440 gene product 
(Query) with the best human homologous match (Sbjct). 

FIGURE 4 shows the increase of triglyceride content of HD-EP(2)21 956 
CHD-EP(2)21956', column 2) flies caused by homozygous viable 
integration of the P-vector into the second intron of CG9042 (in 
comparison to controls without integration of this vector, 'EP-control', 
column 1). 

FIGURE 5 shows the molecular organization of the mutated Gpdh (Gadfly 
Accession Number CG9042) gene locus . 

FIGURE 6 shows the BLASTP search result for the CG9042 gene product 
(Query) with the best human homologous match (Sbjct). 

FIGURE 7 shows the increase of triglyceride content of HD-EP{3)31 805 
CHD-EP31805', column 2) flies caused by homozygous viable integration 
of the P-vector 3 basepairs 5' of CG 7956 (in comparison to controls 
without integration of this vector, 'EP-control', column 1). 

FIGURE 8 shows the molecular organization of the mutated CG7956 
(Gadfly Accession Number) gene locus. 

FIGURE 9 shows the BLASTP search result for the CG7956 gene product 
(Query) with the best human homologous match (Sbjct). 
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The examples illustrate the invention: 

Example 1 : Measurement of triglyceride content 

5 Mutant flies are obtained from a fly mutation stock collection. The flies are 
grown under standard conditions known to those skilled in the art. In the 
course of the experiment , additional feedings with bakers yeast 
(Saccharomyces cerevisiae) are provided for the EP-lines HD-EP(2)21 956 
and HD-EP(3)31 805. The 3verage increase of triglyceride content of 

10 Drosophila containing the EP- vectors as homozygous via ble integration was 
investigated in comparison to control flies (see FIGURES 1, 4 and 7). For 
determination of triglyceride, flies (in case of PX10162. 1 , ten flies in two 
independent assays, respectively) were incubated for 5 min at 90°C (in 
case of PX10162.1 at 70° C) in an aqueous buffer using a waterbath, 

15 followed by hot extraction. After another 5 min incubation at 90°C (in 
case of PX10162.1 at 70° C) and mild centrifugatiori, the triglyceride 
content of the flies extract was determined using Sigma Triglyceride (INT 
336-10 or -20) assay by measuring changes in the optical density 
according, M to the manufacturer's -protocol.- In. case of the EP-Iines 

20 HD-EP(2)21956 and HD-EP(3)31 805, as a reference the protein content of 
the same extract was measured using BI0-RAD DC Protein Assay 
according to the manufacturer's protocol. These assays were repeated 
three times. 

25 The average triglyceride level of about 50 lines of the PX collection 
(referred to as 'PX-control') is shown as 100% (relative amount of 
triglyceride per fly) in the first column in FIGURE 1 . The average 
triglyceride level of all flies of the EP collection (referred to as 'EP-control') 
is shown as 100% in the first columns in FIGURE 4- and 7. Standard 

30 deviations of the measurements are shown as thin bars. 




- 44- 

PX101 62.1 homozygous flies show constantly a higher triglyceride content 
than the controls (38%; column 2 in FIGURE 1, 'PX101 62.1'). Therefore, 
the loss of gene activity in the locus 6C7 on chromosome X where the 
EP-vector of PX10162.1 flies is homozygous viable integrated, is 
responsible for changes in the metabolism of the energy storage 
triglycerides, therefore representing a model for obese flies. The findings 
suggest the presence of similar functions of the homologous proteins in 
humans. 

HD-EP(2)21956 homozygous flies show constantly a higher triglyceride 
content than the controls (67%; column 2 in FIGURE 4, 'HD-EP(2)21 956'). 
Therefore, the loss of gene activity in the locus 26A8 on chromosome 2L 
where the EP-vector of HD-EP(2)21956 flies is homozygous viable 
integrated, is responsible for changes in the metabolism of the energy 
storage triglycerides. 

HD-EP(3)31805 homozygous flies show constantly a higher triglyceride 
content than the controls (45%; column 2 in FIGURE 7, 'HD-EP31805'). 
Therefore, the loss of gene activity in the locus 93E4 on chromosome 3R 
where the EP-vector of HD-EP(3)31805 flies is homozygous viable 
integrated, is responsible for changes in the metabolism of the energy 
storage triglycerides. 

Example 2: Identification of the genes 

Genomic DNA sequences were isolated that are localized to the EP vector 
(herein PX10162.1) integration. Using those isolated genomic sequences 
public databases like Berkeley Drosophila Genome Project (GadFly) were 
screened thereby confirming the homozygous viable integration site of the 
PX10162.1 vector upstream of the 5'-end of a Drosophila gene, identified 
as CG 14440 (GadFly Accession Number). FIGURE 2 shows the molecular 
organization of this gene locus. The chromosomal localization site of the 
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integration of the vector of PX10162.1 is at gene locus X, 6C7. In FIGURE 
2, genomic DNA sequence is represented by the assembly as a dotted grey 
line in the middle that includes the integration sites of vector for line 
PX10162.1. Numbers represent the coordinates of the genomic DNA 
(starting at position 6494-082 on chromosome X, ending at position 
6519082 on chromosome X). The insertion site of the P-element in 
Drosophila PX10162.1 line is shown as ' + ' on the dotted line and as 
vertical line. Dark grey boxes on the lower "cDNA"-line, linked by light grey 
boxes, represent the predicted genes (as predicted by the Berkeley 
Drosophila Genome Project, GadFIy and by Magpie). Predicted exons are 
shown as dark grey boxes, predicted introns are shown as light grey 
boxes. The gene CG14440 is labeled. Transcribed DNA sequences (ESTs) 
are shown as grey bars in the lower "EST" line. Therefore, expression of 
the cDNA encoding CGI 4440 (GadFIy Accession Number) could be 
affected by homozygous integration of vectors of line PX 1 01 62. 1 , leading 
to an increase of the energy storage triglycerides. 

Genomic DNA sequences were isolated that are localized to the EP vector 
(herein HD-EP(2)21*956) • integration. Using those isolated genomic 
sequences public databases like Berkeley Drosophila Genome Project 
(GadFIy) were screened thereby confirming the homozygous viable 
integration site of the HD-EP(2)21 956 vector into the second intron of a 
Drosophila gene in sense orientation, identified as Gpdh (GadFIy Accession 
Number CG9042), FIGURE 5 shows the molecular organization of this gene 
locus. The chromosomal localization site of the integration of the vector of 
HD-EP(2)21956 is at gene locus 2L, 26A8. In FIGURE 5, genomic DNA 
sequence is represented by the assembly as a dotted grey line in the 
middle that includes the integration sites of vector for line HD-EP(2)21 956. 
Numbers represent the coordinates of the genomic DNA (starting at 
position 5861000 on chromosome 2L, ending at position 5867250 on 
chromosome 2L), The insertion site of the P-element in Drosophila 
HD-EP(2)21 956 line is shown as triangle in the upper "P Elements" line and 
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is labeled. Dark grey boxes on the upper "cDNA"-Hne, linked by light grey 
boxes, represent the predicted genes (as predicted by the Berkeley 
Drosophila Genome Project, GadFly and by Magpie). Predicted exons are 
shown as dark grey boxes, predicted introns are shown as light grey 
boxes. The gene CG9042 is labeled. Transcribed DNA sequences (ESTs) 
are shown as grey bars in the upper "EST" line. Therefore, expression of 
the cDNA encoding Gpdh (Accession Number CG9042) could be affected 
by homozygous integ ration of vectors of line HD-EP(2)21 956, leading to an 
increase of the energy storage triglycerides. 

Genomic DNA sequences were isolated that are localized to the EP vector 
(herein HD-EP(3)31 S05) integration. Using those isolated genomic 
sequences public databases like Berkeley Drosophila Genome Project 
(GadFly) were screened thereby confirming the homozygous viable 
integration site of the HD-EP(3)31805 vector 3 base pairs 5' of a 
Drosophila gene in antisense orientation, identified as CG7956 (GadFly 
Accession Number). FIGURE 8 shows the molecular organization of this 
gene locus. The chromosomal localization site of the integration of the 
vector of HD-EP(3)31 805 is at gene locus 3R, 93E4. In FIGURE 8, genomic 
DNA sequence is represented by the assembly as a dotted grey line in the 
middle that includes the integration sites of vector for line HD-EP(3)31805 . 
Numbers represent the coordinates of the genomic DNA (starting at 
position 17260000 on chromosome 3R, ending at position 17270000 on 
chromosome 3R). The insertion site of the P-element in Drosophila 
HD-EP(3)31 805 line is shown as triangle in the lower "P Elements" line and 
is labeled. Dark grey boxes on the upper "cDNA. "-line, linked by light grey 
boxes, represent the predicted genes (as predicted by the Berkeley 
Drosophila Genome Project, GadFly and by Magpie). Predicted exons are 
shown as dark grey boxes, predicted introns are shown as light grey 
boxes. The gene CG7956 is labeled. Transcribed DNA sequences (ESTs) 
are shown as grey bars in the upper "EST" line. Therefore, expression of 
the cDNA encoding CG7956 (GadFly Accession Number) could be affected 
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by homozygous integration of vectors of line HD-EP(3)31 805, leading to an 
increase of the energy storage triglycerides. 

Example 3: Identification of human CG 14440, Gpdh, or Synaptojanin-like 
homologous proteins 

CG 14440, Gpdh, or Synaptojanin-like homologous proteins and nucleic 
acid molecules coding therefore are obtainable from insect or vertebrate 
species, e.g. mammals or birds. Particularly preferred are nucleic acids 
comprising CG1 4440 (GadFly Accession Number), the human hypothetical 
protein LOC55565, Gpdh (Glycerol 3 phosphate dehydrogenase, GadFly 
Accession Number CG9042), the human soluble glycerol-3-phosphate 
dehydrogenase 1, CG7956 (GadFly Accession Number), and the human 
KIAA0966 protein. 

• As shown in FIGURE 3, gene product of GadFly Accession Number 
CG14440 is 57% homologous to human hypothetical protein LOC55565 
(GenBank Accession Number NP_060000.1 for the protein, NM 017530 
for the cDNA). CG1444-0 also shows 58% homology on protein level to 
mouse protein similar to hypothetical protein LOC55565 (GenBank 
Accession Number AAH 231 80.1). 

As shown in FIGURE 6, gene product of GadFly Accession Number 
CG9042 is 72% homologous to human soluble glycerol-3-phosphate 
dehydrogenase 1 (GenBank Accession Number XP_006755.1 for the 
protein, XM_006755 for the cDNA). CG9042 also shows 72% homology 
on protein level to mouse cytoplasmic adult glycerophosphate 
dehydrogenase 1 (GenBank Accession Number NP_O34401 .1 ). 

As shown in FIGURE 9, gene product of GadFly Accession Number 
CG7956 is 52% homologous to human K1AA0966 protein (GenBank 
Accession Number NP 055752.1 for the protein, NM_014937 for the 
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cDNA). CG7956 also shows 57% homology on protein level to mouse 
protein ENSMUSP00000045910 (ENSEMBL Accession Number). 
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A pharmaceutical composition comprising a nucleic acid molecule of 
the CG14440, Gpdh, or Synaptojanin-like gene family or a 
polypeptide encoded thereby or a fragment or a variant of said 
nucleic acid molecule or said polypeptide or an antibody, an aptamer 
or another receptor recognizing a nucleic acid molecule of the 
CG 14440, Gpdh, or Synaptojanin-like gene family or a polypeptide 
encoded thereby together with pharmaceutical^ acceptable carriers, 
diluents and/or adjuvants. 



The composition of claim 1, wherein the nucleic acid molecule is a 
vertebrate or insect CG 14440, Gpdh, or Synaptojanin-like nucleic 
15 acid ' Particulary encoding the human hypothetical protein 

LOC55565, the human soluble glycerol-3-phosphate dehydrogenase 
1, or the human KIAA0966 protein, and/or a nucleic molecule which 
is complementary thereto or a fragment thereof or a variant thereof. 



The composition of claim 1 or 2, wherein said nucleic acid molecule 

(a) hybridizes at 50°C in a solution containing 1 x SSC and 0. 1 % 
SDS to a nucleic acid molecule as defined in claim 2 and/or a 
nucleic acid molecule which is complementary thereto; 

(b) it is degenerate with respect to the nucleic acid molecule of 
(a), 

(c) encodes a polypeptide which is at least 85%, preferably at 
least 90%, more preferably at least 95%, more preferably at 
least 98% and up to 99,6% identical to the human 
hypothetical protein LOC55565, the human soluble 
glycerol-3-phosphate dehydrogenase 1, or the human 
KIAA0966 protein, as defined in claim 2.; 
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(d) differs from the nucleic acid molecule of (a) to (c) by mutation 
and wherein said mutation causes an alteration, deletion, 
duplication or premature stop in the encoded polypeptide. 

5 4. The composition of any one of claims 1 -3, wherein the nucleic acid 
molecule is a DNA molecule, particularly a cDNA or a genomic DNA. 

5. The composition of any one of claims 1-4, wherein said nucleic acid 
encodes a polypeptide contributing to regulating the energy 

10 homeostasis and/or the metabolism of triglycerides. 

6. The composition of any one of claims 1 -5, wherein said nucleic acid 
molecule is a recombinant nucleic acid molecule. 

15 7. The composition of any one of claims 1-6, wherein the nucleic acid 
molecule is a vector, particularly an-expression vector. - 

8. The composition of any one of claims 1-5, wherein the polypeptide 
is a recombinant polypeptide. 

20 

9. The composition of claim 8, wherein said recombinant polypeptide is 
a fusion polypeptide. 

1 0. The composition of any one of claims 1 -7, wherein said nucleic acid 
2 5 molecule is selected from hybridization probes, primers and 

anti-sense oligonucleotides. 

11. The composition of any one of claims 1-10 which is a diagnostic 
composition. 



30 



12, The composition of any one of claims 1-10 which is a therapeutic 
composition. 
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The composition of any one of claims 1-12 for the manufacture of 
an agent for detecting and/or verifying, for the treatment, alleviation 
and/or prevention of an disorders, including metabolic diseases such 
as obesity and other body-weight regulation disorders as well as 
related disorders such as eating disorder, cachexia, diabetes 
mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, 
cancer, e.g. cancers of the reproductive organs, and sleep apnea 
and others, in cells, cell masses, organs and/or subjects. 

Use of a nucleic acid molecule of the CG 14440, Gpdh, or 
Synaptojanin-like gene family or a polypeptide encoded thereby or a 
fragment or a variant of said nucleic acid molecule or said 
polypeptide or an antibody, an aptamer or another receptor 
recognizing a nucleic acid molecule of the CG 14440, Gpdh, or 
Synaptojanin-like gene family or a polypeptide encoded thereby for 
controlling the function of a gene and/or a gene product which is 
influenced and/or modified by a CG 14440, Gpdh, or 
Synaptojanin-like homologous polypeptide. 

Use of the nucleic acid molecule of the CG14440, Gpdh, or 
Synaptojanin-like gene family or a polypeptide encoded thereby or a 
fragment or a variant of said nucleic acid molecule or said 
polypeptide or an antibody, an aptamer or another receptor 
recognizing a nucleic acid molecule of the CG 14440, Gpdh, or 
Synaptojanin-like gene family or a polypeptide encoded thereby for 
identifying substances capable of interacting with a CG14440, 
Gpdh, or Synaptojanin-like homologous polypeptide. 

A non-human transgenic animal exhibiting a modified expression of 
a CG 14440, Gpdh, or Synaptojanin-like homologous polypeptide. 
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17. The animal of claim 16, wherein the expression of the CG 14440, 
Gpdh, or Synaptojanin-like homologous polypeptide is increased 
and/or reduced. 

18. A recombinant host cell exhibiting a modified expression of a 
CG14440, Gpdh, or Synaptojanin-like homologous polypeptide. 

19. The cell of claim 18 which is a human cell. 

20. A method of identifying a (poly)peptide involved in the regulation of 
energy homeostasis and/or metabolism of triglycerides in a mammal 
comprising the steps of 

(a) contacting a collection of (poly)peptides with a CG 14440, 
Gpdh, or Synaptojanin-like homologous polypeptide or a 
fragment thereof under conditions that allow binding of said 
(poly)peptides; 

(b) removing (poly)peptides which do not bind and 

(c) identifying (poly) peptides that bind to said CG14440, Gpdh, 
_ t . 0 r Synaptojanin-like homologous polypeptide. 

21 . A method of screening for an agent which modulates the interaction 
of a CG 14440, Gpdh, or Synaptojanin-like homologous polypeptide 
with a binding target/agent, comprising the steps of 

(a) incubating a mixture comprising 



(aa) a CG 14440, Gpdh, or Synaptojanin-like homologous 
polypeptide or a fragment thereof; 

(ab) a binding target/agent of said CG 14440, Gpdh, or 
Synaptojanin-like homologous polypeptide or fragment 
thereof; and 

(ac) a candidate agent 
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under conditions whereby said CG 14440, Gpdh, or 
Synaptojanin-like polypeptide or fragment thereof specifically 
binds to said binding target/agent at a reference affinity; 

(b) detecting the binding affinity of said CGI 4440, Gpdh, or 
Synaptojanih-like polypeptide or fragment thereof to said 
binding target to determine an (candidate) agent-biased 
affinity; and 

(c) determining a difference between (candidate) agent-biased 
affinity and the reference affinity. 

A method of producing a composition comprising the (poly)peptide 
identified by the method of claim 20 or the agent identified by the 
method of claim 21 with a pharmaceutical^ acceptable carrier, 
diluent and/or adjuvant. 

The method of claim 22 wherein said composition is a 
pharmaceutical composition for preventing, alleviating or treating of 
diseases and disorders, including metabolic diseases such as obesity 
and other body-weight regulation disorders as well as related 
disorders such as eating disorder, cachexia, diabetes mellitus, 
hypertension, coronary heart disease, hypercholesterolemia, 
dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the 
reproductive organs, and sleep apnea and other diseases and 
disorders. 

Use of a (poly)peptide as identified by the method of claim 20 or of 
an agent as identified by the method of claim 21 for the preparation 
of a pharmaceutical composition for the treatment, alleviation and/or 
prevention of of diseases and disorders, including metabolic diseases 
such as obesity and other body-weight regulation disorders as well 
as related disorders such as eating disorder, cachexia, diabetes 
mellitus, hypertension, coronary heart disease, 
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hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, 
cancer, e.g. cancers of the reproductive organs, and sleep apnea 
and other diseases and disorders. 



5 25. Use of a nucleic acid molecule of the CG14440, Gpdh, or 
Synaptojanin-like family or of a fragment thereof for the preparation 
of a non-human animal which over- or under-expresses the 
CG14440, Gpdh, or Synaptojanin-like gene product. 

10 26. Kit comprising at least one of 

(a) a CG 14440, Gpdh, or Synaptojanin-like nucleic acid molecule 
or a fragment thereof; 

(b) a vector comprising the nucleic acid of (a); 

(c) a host cell comprising the nucleic acid of (a) or the vector of 
15 (b); 

(d) a polypeptide encoded by the nucleic acid of (a); 

(e) a fusion polypeptide encoded by the nucleic acid of (a); 

(f) an antibody, an aptamer or another receptor against the 
nucleic acid of (a) or the polypeptide of (d) or "(e) and 

20 (g) an anti-sense oligonucleotide of the nucleic acid of (a). 
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Abstract V> % 



The present invention discloses CG 14440, Gpdh, or Synaptojanin-like 
s homologous proteins regulating the energy homeostasis and tho 
metabolism of triglycerides, and polynucleotides, which identify and 
encode the proteins disclosed in this invention. The invention also relates 
to the use of these sequences in the diagnosis, study, prevention, and 
treatment of diseases and disorders, for example, but not limited to, 
10 metabolic diseases such as obesity as well as related disorders such as 
eating disorder, cachexia, diabetes mellitus, hypertension, coronary heart 
disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, 
cancers of the reproductive organs, and sleep apnea. 

15 
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FIGURE 2. Molecular organization of the CG14440 gene (GadFly Accession 
Number) 
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FIGURE 3. BLASTP results for CG14440 (GadFly Accession Number) 
Homology to human protein NPJ)60000.1 (GenBank Accession Number) 

Le^gS^ 6 ??? 0 ^' (NM -° 17530) hypothetical protein LOC55565 [Homo sapiens] 

Score = 77.4 bits (189), Expect = 2e~13 

Identities = 41/106 (38%), Positives = 62/106 (57%) 

Query: 195 QGQSSRAQKAARRRSNESIEARERRLERNA^ 2 54 

+ Q+ +K A RR NE +E R +RLER + +R E+ E VR ++ EA R++R 

Strjct: 207 EAQTPSVRKWALRRQNEPLEVRLQRLERERT^ 266 

Query: 255 QNETEVQRTLRLMKNAARQRLRRASETVEERKKRLAKAAERMRIAR 300 

ET+ ++ RL+RA+ET E+R+ RL + E R+ R 

Sbjct: 267 MQETDEQRARRLQRDREAMRLKRANETPEKRQARLIREREAKRLKR 312 



FIGURE 4. Triglyceride content of a Drosophila Gpdh (GadFly Accession 
Number CG9042) mutant 




FIGURE 5. Molecular organisation of the Gpdh gene (GadFly Accession 
Number CG9042) 

CG9042= Glycerol 3 phosphate dehydrogenase 
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FIGURE 6. BLASTP results for CG9042 (GadFly Accession Number) 



Homology to human protein XP_006755.1 (GenBank Accession Number) 

gi 1 11440382 | ref | XP_J)06755 . 1 1 (XM.006755) glycerol -3 -phosphate dehydrogenase 1 
(soluble) [Homo sapiens] Length = 349 

Score = 423 bits (1087), Expect « e-118 

Identities = 217/351 (61%), Positives = 256/351 (72%), Gaps = 3/351 (0%) 

Query: 1 MADKVTWCIVGSGNWGSXXXXXXXX 60 

MA K VCIVGSGNWGS- L +F+ RVTM+V+EE I GKKLTEIIN HE 

Sbjct : 1 MASK-KVCI VGSGNWGSAI AKIVGGNAAQLAQFDPRVTMWVFEEDIGGKKLTEI INTQHE. . 59. 

Query: 61 NvlC^KGHKLPPNVVAVPDLVEAAKNADILIFVVPHQF I PNFCKQLLGKIKPNAI AISLI 120 

NVKYL GHKLPPNWAVPD+V+AA++ADILIFWPHQFI C QL G +K NA ISLI 
Sbjct: 60 NVKYLPGHKLP PNWAVPDWQAAED AD ILI F WPHQF IGKI CDQLKGHLKANATGI SLI 119 

Query: 121 KGFDKAEGGGIDLISHIITRHLKIPCAVXMGANLANEVAE 180 

KG D+ G+ LIS +1 L IP +VLMGAN+A+EVA+ FCETTIGC D G++L+ 

Sbjct: 120 KGVDEGP-NGLKLISEVIGERLGIPMSVLMGANIASEVADEKFCETTIGCKDPAQGQLLK 17 8 

Query: 181 DLFQANHFRXXXXXXXXXXXXCGALKNIVACGAGFVT)GLKLGDNTKAAVIRLGLMEMIRF 240 

+L Q +FR CGALKN+VA GAGF DGL GDNTKAAVIRLGLMEMI F 

Sbjct: 179 ELMQTPNFRITVVQEVTDTV^ICGALKNVVAVGAGFCDGLGF 238 

Query: 241 TOVTYPGSKLS-TFFESCGVADLITTCYGGRNRRVSEAFVTSGKTIEELEKEMLNGQKLQ 299 

+F G S TF ESCGVADLITTCYGGRNR+V+EAF +GK+ IE+LEKE+LNGQKLQ 
Sbjct: 239 AKLFC SGPVSS ATFLE SCGVADLITTCYGGRNRKVAEAF ARTGKS I EQLEKELLNGQKLQ 298 



Query: 300 GPPTAEEVNYMLKNKGLEDKF PLFTAIHKICTNQLKPNDLIDC IRNHPEHM 350 

GP TA E+ +L++KGL DKFPLF A++K+C + I C++NHPEHM 

Sbjct: 299 GPETARELYSILQHKGLVDKFPLFMAVYKVCYEGQPVGEFIHCLQNHPEHM 349 



FIGURE 7. Triglyceride content of a Drosophila CG7956 (GadFly Accession 
Number) mutant 




FIGURE 8. Molecular organization of the CG7956 gene (GadFly Accession 
Number) 
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FIGURE 9. BLASTP results for CG7956 (GadFly Accession Number) 
Homology to human protein NP_055752.1 (Gen Bank Accession Number) 

L^ngS"! 5 ""' 1 ' (N **- 014937 > KIAA0966 protein [Homo sapiens] 
Score = 573 bits (1477), Expect = e-162 

Identities = 354/972 (36%), Positives = 514/972 (52%), Gaps = 114/972 (11%, 

Query: 1 ^QTDS^IFVKRDKSLWWHRRTSEFSIKAGWDLSSVTDDXECIGVTHGIVGVISLPW 60 
ME+FQ HYI + +++LW RR ++ DL + C+G+ G++r T r ™ " 

SbDCt: 1 MELFQAKDHVI LQQGERALWC SRRDGGLQLRPATDLLL AWNP ICLGL VEGVIGKIQ.LHSD 60 ■ 

Query: 61 YEPHI.VWKEASAVGVLYPPHLVYKIKSICILSADD PDTDLPNCTKHTKSNQSTPTH 117 

+ VG L H V K+ I +LS + D +L, C KH ° WU1>1 *" 1 117 
Sb^ct: 61 LPWWLILIRQKALVGKLPGDHEVCKVTKIAVLSLSEMEPQDLELEDCKKH no 

Query: 118 SVSTSNNNHASVPSSGGGSSKSTKLFEGMNKTWGAVKSAGNT IKNTTQQAANLATKQ 174 

Sbicfc- 111 _ G+HK + S ++ T +N++ 

" HFGXNKPEKI IPSPDDSKFLLKTFTHIK SNVSAPN 145 

Query: 175 VKSSVGIREPRHIERRITEELHKXFDETDSFYFSFDCDITNNLQRHEAKSEESQ SQP 231 

. K +E +ERR+ EEL K+F +++SFY+S D+TN++QR + + + 

Sb 3 ct: 146 KKKVKESKEKEKLERRLLEELI^MFMDSESFYYSLTMJLTNSVQRQSTGERDGRPLWQKV 205 



Query: 232 DERFFWNKHMIRDLINLNDKT WILPIIQGFMQVEHCVIG— —— — __ , 7n 

•_. . t „„ D+RFFWNK+M1+DI, + WI+P+IQGF+Q+E V+ " 

Sbjct: 20o DDRFFWNKYMIQDLTEIGTPDVDFWIIPMIQGFVQIEELWNYTESSDDEKSSPETPPQF — 



Query: 271 NEC FTIiALVSRRSRHRAGTRYKRRGVDEKGNCANYVETEQILSFRHHQLSFTQ 323 

cj^^h. ,« ZJL P +AL+SRRSRHRAG RYKRRGVD+ GN AWYVETEQ++ +H LSF O 

Sb.ct: 266 STCVDDIHPRFLVALISRRSRHRAGMRYKRRG^ 325 

Query: 324 VRGSWIYWSQPGYKYRPPPRUJRGVAETQQAFELHFTKELETYGRVCIVNLVEQSGKEK 383 

RGSVP+ +WSQ GY+Y P PRLDR ET F HF + +X. y + X+NLV+O+G+EK 
Sb3ct: 326 TRGSVPVFWSQVGYRYNPRPRr^RSEKETVAYFCAHFEEQr^IYKKQV^I^ 3 85 

Query: 3 84 ^^^^^^^^^^IKl^roRLIYVTFDFHDYCRGMRFEWSAIjIDAVGPEAGAMGFHWRDQRG 443 
Sh^,. T 5°° A * V+ NN L YV+FDFH++CRGM+FENV L DA+ M + W D+ G 

Sb 3 Cfc: 386 I IGDAYLKQVLLFNNSHLTYVSFDFHEHCRGMKFENVQTIjTDAX YDI ILDMKWCWVDE AG 445 

Query: 444. '^CNQKSWRVNC^CLDRTNWQTAIGKAVI.ESQIiVKLGLSPPYTPIPEQIiKSPFMVLW 503 

+IC Q+ +FRVNCMDCLDRTNWQ AX + V+E QL Kt,C5+ PP p+p + + ++w 
Sb 3 ct: 446 VICKQEGIFRVNCMDCLDRTM\A/QAAIARVVMEQQLKKI,C3VMPPEQPI,PVKCNRIYQIMW 505 

Query: 504 ANNGDIISRQYAGTNALKGDYTRTGERKISG> ! MKDGMNS^^YYLARFKDSYRQATXDI^ 563 

sb-ict. sn« ™L™ YAGT alkgd+t Rtgerk ++ g + mkdg + wsanryyl rfkd+yrqa IDLM 

Sbjct: 506 AlWGDSISRQYAGTAALKGDFTRTGERKIAGV>DaDGVNSA^YYLNRFKDAYRQAVrDLM 565 

Query: 564 LGNQVSSESLSALGGQAGPD ENDGTENAEQAKLLVED CRRLLLGTAQYPVG AWGLID 620 

Sbjct: 566 QGIPVTEDLYSIFTKEKEHEALHl^NQRSHQELISQLLQ 625 

Query: 621 ADPSSGDINETE^ILLLTDDCYIVAEYDSHLDKIVRFEKVQLTQVRL 680 

- J? PS ° *VD Y VA YD +DK+ ++-»-++ L + IE+G + + 

Sbjct: 626 roPSLIDATHRDVDVLLLLSNSAYYVAYYDDEX/DKVNQYQRLSLENLEKIEIG — PEPTI, 683 

Query: 681 FQGSAPAHLCI.R^SvDEQEGYFHMFRSANLR^ 740 
Sbjct: 684 F--GKPKFSC^RLHYRYKEASgSSti1 AV^PEEDG^QC^AeIS 731 



Query: 7 41 IALDNAGNTEVRYITGGVLQRRKSKLPTLDV PRGMPRNL SE SQLVQLS SKA 791 

I G+ I L+R+ SK P D+ +N S+ K 

Sbjct: 732 ITKQAMGSD— LPIIEKKLERKSSK-PHEDIIGIRSQNQGSLAQGKNFLMSKFSSLNQKV 788 

Query: 7 92 LSNMA GQFSKLGQTPKKPQAHPSSLAATMNPQVMRQRDSEIESGQEAEKAVFTLGR 847 

+ G KLG P KP+ + L + + + DS +E+ + V + 

Sbjct: 7 89 KQTKS3^IVNIGNLRKLG-NFTKPEMKVNFLKP^ILKVNLWKS -DSSLETMENT- -GVMDKVQ 844 

Query: 8 48 KHRNSNSAS STDTDEHDNSLYEPEVDSDVE IAMDKSNYNE -NAFLPSVGIVMG NQK 902 

+ + +S D+ D L + D D ++A + + LPS GI+ + 

Sbjct: 8 45 AESDGDMSSDNDSYHSDEFLTNSKSDEDRQLANSIiESVGPIDYVLPSCGIIASAPRLGSR 904 



Query: 903 EDSPSSSDEIRH 914 

S SS+D H 
Sbjct: 905 SQSLSSTDSSVH 916 
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